Recombinator-<i>k</i>-Means: An Evolutionary Algorithm That Exploits <i>k</i>-Means++ for Recombination
نویسندگان
چکیده
We introduce an evolutionary algorithm called recombinator-$k$-means for optimizing the highly non-convex kmeans problem. Its defining feature is that its crossover step involves all members of current generation, stochastically recombining them with a repurposed variant $k$-means++ seeding algorithm. The recombination also uses reweighting mechanism realizes progressively sharper stochastic selection policy and ensures population eventually coalesces into single solution. compare this scheme state-of-the-art alternative, more standard genetic deterministic pairwise-nearest-neighbor elitist policy, which we provide augmented efficient implementation. Extensive tests on large challenging datasets (both synthetic real-word) show fixed sizes generally superior in terms optimization objective, at cost expensive step. When adjusting two algorithms to match their running times, find short times (augmented) method always superior, while longer will it and, most difficult examples, take over. conclude reweighted whole-population costly, but better escaping local minima. Moreover, algorithmically simpler general (it could be applied even $k$-medians or $k$-medoids, example). Our implementations are publicly available \href{https://github.com/carlobaldassi/RecombinatorKMeans.jl}{https://github.com/carlobaldassi/RecombinatorKMeans.jl}.
منابع مشابه
E-Means: An Evolutionary Clustering Algorithm
In this paper we propose a new evolutionary clustering algorithm named E-means. E-means is an Evolutionary extension of k-means algorithm that is composed by a revised k-means algorithm and an evolutionary approach to Gaussian mixture model, which estimates automatically the number of clusters and the optimal mean for each cluster. More specifically, the proposed Emeans algorithm defines an ent...
متن کاملSOSPD Controllers Tuning by Means of an Evolutionary Algorithm
The Proportional Integral Derivative (PID) controller is the most widely used industrial device to monitoring and controlling processes. There are numerous methods for estimating the controller parameters, in general, resolving particular cases. Current trends in parameter estimation minimize an integral performance criterion. Therefore, the calculation of the controller parameters is proposed ...
متن کاملPersistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm
Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...
متن کاملK+ Means : An Enhancement Over K-Means Clustering Algorithm
K-means (MacQueen, 1967) [1] is one of the simplest unsupervised learning algorithms that solve the well-known clustering problem. The procedure follows a simple and easy way to classify a given data set to a predefined, say K number of clusters. Determination of K is a difficult job and it is not known that which value of K can partition the objects as per our intuition. To overcome this probl...
متن کاملDesigning an Algorithm for Cancerous Tissue Segmentation Using Adaptive K-means Cluttering and Discrete Wavelet Transform
Background: Breast cancer is currently one of the leading causes of death among women worldwide. The diagnosis and separation of cancerous tumors in mammographic imagesrequire accuracy, experience and time, and it has always posed itself as a major challenge to the radiologists and physicians. Objective: This paper proposes a new algorithm which draws on discrete wavelet transform and adaptive ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Evolutionary Computation
سال: 2022
ISSN: ['1941-0026', '1089-778X']
DOI: https://doi.org/10.1109/tevc.2022.3144134